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SECTION  I 
INTRODUCTION 

The  purpose  of  this  study  is  to  investigate  the  feasibility  of 
applying  a deflection  detector  to  short-period  seismic  events  recorded  at  the 
Korean  Seismic  Research  Station  (KSRS).  Another  goal  is  to  design  detector 
algorithms  which  operate  on  a variety  of  signal  spectra  with  better  operating 
characteristics  than  would  be  obtained  by  a power  detector.  For  any  detector, 
the  ideal  receiver  will  match  its  input  time  gate  to  the  signal  duration  and  its 
bandwidth  to  the  signal  and  noise  spectra.  However,  when  a variety  of  (pos- 
sibly unknown)  signals  are  being  detected  in  a dynamic  noise  environment, 
compromises  are  necessary.  In  general,  no  one  filter  will  be  optimal  for  a 
large  class  of  signals.  The  deflection  detector  represents  an  attempt  to  adapt 
to  such  a situation  by  utilizing  the  individual  frequency  cells  of  an  FFT  (Fast 
Fourier  Transform)  as  a bank  of  filters  which  can  accommodate  signals  with 
widely  differing  spectra.  At  each  time  gate,  the  discrete  spectral  power  is 
measured  from  the  lowest  frequency  of  interest  to  the  highest  frequency  of 
interest.  These  spectral  powers  comprise  the  time  series  of  spectral  data 
from  which  detection  statistics  will,  be  derived. 

The  details  of  the  detector  design  are  described  in  Section  II. 

A statistical  analysis  of  the  detector's  performance  with  seismic  noise  is 
given  in  Section  M.  Receiver  Operating  Characteristics  (ROC)  are  presented 
in  Section  IV  based  on  detector  trials  on  four  representative  KSRS  signals  with 
seismic  noise  added  to  their  waveforms.  Conclusions  and  suggestions  for  fur- 
ther study  are  given  in  Section  V. 
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SECTION  II 


THEORY 


A.  DEFLECTION  DETECTOR 

The  deflection  processor  is  a frequency  domain  detection  tech- 
nique. It  detects  large  power  deflections  measured  in  individual  FFT  (Fast 
Fourier  Transform)  cells. 

A moving  time  window  of  2N  points  of  seismic  data  is  input 
to  the  detector  and  Fourier  transformed.  The  power  spectrum,  or  in  some 
cases  the  log  power  spectrum,  is  found  at  N discrete  points  in  the  usual  way. 
The  power  or  log  power  at  the  k^  frequency,  where  k ranges  from  1 to  N, 
is  normalized  by  subtracting  its  mean  value  and  dividing  the  result  by  its 
standard  deviation.  Thus  for  the  i^  segment  of  data  we  have 


Z.(k) 


P.(k)  - n(k) 
o(kj 


(II- 1) 


th  th 

where  P^(k)  is  the  power  at  the  k frequency  for  the  i time  segment, 
ti(k)  is  the  mean  of  the  power  at  the  k**1  frequency,  and  a(k)  is  its  stand- 
ard deviation.  Regarding  k as  fixed  for  the  moment,  we  can  think  of  Z.(k) 
as  a random  function  of  the  variable  i , with  zero  mean  and  unit  standard 
deviation.  There  are  a total  of  N such  variables,  one  for  each  frequency 
index  k . 


At  each  value  of  i , i.  e.  , for  each  input  segment,  we  choose 


the  largest  value  of  the  set  Z^(k),  where  k is  now  varied  to  find  the  maxi- 
mum over  the  frequency  range  of  interest.  It  is  legitimate  to  make  this 
comparison  over  frequency,  since  each  power  has  been  normalized.  If  this 


maximum  exceeds  some  preset  value,  called  the  detection  threshold,  a detec- 
tion is  declared. 


The  parameters  M(k)  and  cr (k ) must  be  estimated  from  signal- 
free  sections  of  data.  Finding  the  best  way  to  do  this  is  one  of  the  important 
problems  of  detector  design,  see  Appendix  A. 

Another  detector,  called  the  average  deflection  power  detector, 
is  also  evaluated  here.  It  is  denoted  at  the  i segment  and  is  derived 
from  the  deflection  detector  through 


X.  = 

1 


signal 

band 


Z.(k). 

1 


(II- 2) 


These  detectors  are  compared  to  one  operating  in  the  time 
domain  and  described  by  Swindell  and  Snell  (1977).  There  the  data  are  pre- 
filtered and  the  average  power  computed  over  a suitable  time  segment.  This 
power  is  normalized  by  subtracting  the  mean  power  and  dividing  by  the  stand- 
ard deviation,  and  the  resulting  quantity  compared  to  a threshold.  In  the  pre- 
sent study  this  form  of  detector  was  implemented  in  the  frequency  domain 
through  the  equations 


|N At  signal  Pi(k)j  " M 
band 


(II- 3) 


Here  At  is  the  sampling  period  of  the  data,  so  by  Parseval's  theorem  the 

th. 

bracketed  quantity  is  the  average  power  over  the  i data  segment.  The 
mean  and  standard  deviation  of  the  average  power  are  [i  and  CT  , respective- 
ly, which  must  be  computed  from  signal-free  data. 

While  the  variables  X.,  Y.,  and  Z.  have  unit  standard  devia- 

ii  l 

tion  and  zero  mean,  all  other  details  of  their  distribution  are  unknown  for  the 
moment. 


B. 


PROCESSING  TECHNIQUES 


1 . Normalization 

The  comparison  of  different  statistical  distributions  is  basic 
to  the  concept  of  the  deflection  processor.  As  described  in  the  previous  sub- 
section, its  fundamental  assumption  is  that  the  mean  and  standard  deviation 
of  the  noise  are  sufficient  to  determine  the  detectability  of  the  signal.  Actu- 
ally, this  is  strictly  true  only  when  the  power  distributions  of  the  individual 
FFT  cells,  denoted  P^  for  noise  alone  and  Pg+N  for  signal-plus-noise, 
possess  the  properties 

(i)  P is  determined  by  its  first  two  moments,  H and  CT,  where 

N 

/iand  a may  be  independently  specified. 

(ii)  ^S+N^X)  = ^N^X"C)  w^ere  c is  a constant;  i.  e.  , the  presence 
of  a signal  merely  translates  the  probability  distribution  a 
fixed  distance,  c. 

An  examination  of  the  noise  distributions  for  the  seismic  data 
under  consideration  (see  Section  El)  revealed  that  the  logarithm  of  the  spec- 
trum produced  nearly  normal  distributions.  For  convenience  in  later  calcu- 
lations it  was  decided  to  perform  a logarithmic  transformation  on  the  spectral 
power,  P(k), 

P'(k)  = log  P(k)  . (H-4) 


Note  that  this  implies  that  the  mean  M'(k)  and  standard  deviation  cy'(k)  of  the 
noise  are  computed  after  taking  the  logarithm  of  the  spectrum: 


For  purposes  of  comparison,  the  detector  is  also  evaluated  without  logarithmic 
transformation. 


2.  Coherent  Versus  Incoherent  Gain 


Consider  a particular  signal  spectrum  S(k)  to  be  detected  in 
noise  having  a mean  spectrum  ji(k)  and  a standard  deviation  a(k).  The  most 
probable  frequency  cell  k for  detection  of  the  signal  by  the  deflection  proces- 
sor is  the  one  for  which  Z(k)  is  a maximum.  In  practice,  it  was  found  that 
this  condition  was  satisfied  in  most  cases  by  a single  frequency  cell  of  the 
32-point  FFT  transform  (see  Section  IV).  Thus,  the  deflection  detector  is 
approximately  equivalent  to  that  pictured  in  Figure  II-l(a).  Similar  diagrams 
are  given  for  the  deflection-power  detection  and  the  conventional  power  detec- 
tor in  Figures  II- 1(b)  and  II- 1(c). 


An  examination  of  Figure  II- 1 reveals  that  the  deflection  pro- 
cessor provides  two  types  of  gain.  The  first,  commonly  referred  to  as  co- 
herent gain  (CG),  is  the  result  of  maximizing  Z.(k)  and  is  given  (in  decibels) 
by 


CGd 


1 0 lo§i  o 


(signal/noise)cen  A 
(Signal/noi8e)s.Bnalband 


= 10  lo*l0 


S(k)  III  (k) 

£ S(k)  / £ M(k) 

signal  ' signal 

band  band 


(II- 6) 


(Note  that  for  some  spectra  this  may  be  less  than  zero  dB;  i.  e.  , a loss.  ) The 
second  gain,  so-called  incoherent  gain,  is  the  reduction  of  noise  standard 
deviation  obtained  when  the  ir/iximum  of  a set  of  N random  variables  is  it- 
self treated  as  a random  variable.  In  the  present  case,  the  logarithms  of 

th 

the  normalized  noise  powers  in  the  i data  segment  have  an  (approximately) 

A 

Gaussian  distribution,  but  the  distribution  of  their  maxima  Z(k)  is  given  by 


! 


(c)  Power 


FIGURE  II- 1 

ABBREVIATED  MODELS  OF  THREE  DETECTORS 


P(Z(k)) 


I 

J 

-«o 


G(x)dx 


G(Z(k)) 


(H-7) 


» / 

where  G(x)  is  the  Gaussian  probability  density.  The  standard  deviation  of  G 
is  difficult  to  find  analytically,  but  may  be  obtained  in  a straightforward  man- 
ner using  Monte  Carlo  techniques.  Denoting  this  standard  deviation  by  <x  , 
we  have  for  the  incoherent  gain 

IGd  = 10  log  (1  / <rz)  . (II- 8) 

This  quantity  is  plotted  as  a function  of  N in  Figure  II- 2. 

Also  shown  in  Figure  II- 2 is  the  incoherent  gain  of  the  deflection 
power  detector  and  the  power  detector,  both  of  which  are  equal  to 

IGp  = 10  log  VTT  . (Et-9) 

This  gain  is  due  to  reduction  of  the  standard  deviation  of  the  noise  power 
through  averaging  over  N points,  whether  they  be  in  the  frequency  or  time 
domain. 

One  would  expect  the  total  gain  to  reflect  the  performance  of 
the  detectors.  Strictly  speaking,  this  is  true  only  in  the  case  where 
^S+N^  = ^j(x_c)  ' Furthermore»  the  incoherent  gains  for  the  deflection 
detector  and  deflection-power  detector  are  those  indicated  only  when  the  dis- 
tributions of  the  individual  frequency  cells  are  identical  (e.  g.  , flat  signal  and 
flat  noise  spectrum)  and  Gaussian.  Thus,  in  general,  equations  (II- 5 ) and 
(II-6)  provide  only  a guideline  to  the  relative  effectiveness  of  the  three  detec- 
tors. The  degree  of  reliance  that  one  may  place  on  these  indicators  will  be 
treated  in  Section  IV. 


3. 


Effects  of  Parameters  on  Detectability 


The  first  major  design  parameter  encountered  is  the  length  of 

the  processing  gate  T.  It  must  be  matched  to  the  expected  signal  duration 

and  implemented  such  that  the  processing  gain  is  maximized  and  the  false 

alarm  rate  is  minimized.  The  number  of  false  alarms  per  hour  R , the 

F 

probability  of  false  alarm  P , and  T are  related  by 

F 


Increasing  T can 

(i)  provide  coherent  gain  by  increasing  spectral  resolution, 

(ii)  provide  incoherent  gain  by  averaging  over  more  points,  and 

(iii)  permit  a higher  probability  of  false  alarm  P for  a given 

F 

false  alarm  rate  R (equation  (II- 10)), 

F 

but  may  also 

(iv)  decrease  the  signal-to-noise  ratio  by  including  noise  where 
the  signal  is  absent. 

Thus,  T should  be  chosen  as  large  as  possible  without  incurring  the  penalty 
(iv). 

Assuming  that  an  appropriate  T has  been  chosen  for  the  va- 
riety of  signals  to  be  detected,  a subdivision  into  FFT  gates  may  be  made. 

It  is  advisable  to  overlap  processing  gates  in  order  to  match  the  signal  start 
time  as  closely  as  possible.  Increasing  the  number  of  overlapping  segments 
M trades  off  a possible  coherent  gain  (i.  e.  , decreases  FFT  resolution)  for 
an  incoherent  gain  coupled  with  a better  chance  of  matching  the  start  of  the 
signal.  Overlap  may  also  be  obtained  without  increasing  M,  but  at  the  ex- 
pense of  performing  more  FFT  transforms.  There  is  no  change  in  P be- 
cause,  although  the  number  of  outputs  are  increased,  they  are  statistically 


dependent,  and  the  inclusion  of  an  M- output- gate  dead  time  after  a detection 
prevents  a doubling  of  the  alarm  rate. 

Another  parameter  of  importance  is  the  prefilter.  For  a given 
variety  of  signals  and  noise  spectra  it  is  obviously  advantageous  to  filter 
out  those  portions  of  the  spectrum  which  consist  primarily  of  noise.  An  ex- 
treme example  is  presented  when  the  signal  to  be  detected  and  the  noise  sta- 
tistics are  completely  known.  In  that  case,  for  a given  P , there  exists  a 
filter  which  will  optimize  the  probability  of  detection  for  the  power  de- 

tector. Since,  for  a single  signal,  the  deflection  detector  is  approximately 
a power  detector  with  a narrowband  prefilter,  it  would  in  general  perform 
more  poorly  than  such  an  optimum  filter  (i.  e.  , the  narrowband  filter  would 
not  be  optimal).  If,  however,  the  variety  of  signals  to  be  detected  is  large, 
a single  optimum  filter  is  not  possible  since  one  does  not  wish  to  filter  one 
signal  at  the  expense  of  another,  and  a likely  choice  for  the  prefilter  is  a 
simple  bandpass  filter.  It  should  be  noted  that  a specific  advantage  of  the 
deflection  processor  is  that  it  provides  a bank  of  filters  (the  FFT  cells)  as 
opposed  to  a single  filter  which  may  be  utilized  in  detecting  a broad  variety 
of  signals. 

Finally,  let  us  consider  the  size  of  the  time  constant  used  in 
updating  estimates  of  the  noise  statistics.  If,  as  in  the  present  study,  the 
background  noise  is  only  quasi- stationary,  the  noise  statistics  must  be  peri- 
odically updated.  When  such  an  update  involves  a time  average,  the  averag- 
ing time  should  be  chosen  short  enough  to  insure  short-term  stationarity  and 
yet  long  enough  to  provide  a reliable  estimate.  This  is  extremely  important 
because  the  results  will  directly  affect  the  choice  of  a detection  threshold. 
While  a detector's  ROC  curves  (P  versus  P ) are  independent  of  any  choice 
of  threshold,  the  actual  performance  of  the  detector  is  not.  If  a threshold 
(possibly  time  dependent)  cannot  be  set  to  provide  a desired  false  alarm  rate, 
then  even  a superior  detector  may  be  impractical  to  implement  or  may  per- 
form poorly. 
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SECTION  ni 
NOISE  STATISTICS 


The  design  of  a detector  requires  an  accurate  signal  model  and 
knowledge  of  the  statistical  properties  of  the  noise  in  which  it  is  embedded. 
Thus,  it  is  imperative  to  examine  the  noise  statistics  as  extensively  as  pos- 
sible. To  do  this  one  must  choose  a homogeneous  ensemble  of  independent 
noise  samples.  If  the  noise  is  not  stationary  (but  only  one  sample  is  available 
at  a given  time),  the  ensemble  must  be  chosen  within  a short  enough  time  pe- 
riod to  insure  approximate  stationarity. 

In  the  present  case,  a continuous  2 hour  40  minute  sample  of 
noise  was  edited  from  a Korean  Seismic  Research  Station  (KSRS)  data  tape 
for  examination  and  beamformed  at  20  km/ sec.  The  96,  000  points  (obtained 
at  a sample  rate  of  10/ sec)  were  divided  into  3,  000  gates  of  3.  2 seconds  each 
corresponding  to  a 32-point  FFT,  and  into  6,000  gates  of  1.  6 seconds,  cor- 
responding to  a 16-point  FFT.  The  gates  were  Fourier  transformed  and  the 
statistics  of  the  noise  power  N(k)  for  each  frequency  cell  k were  studied. 
Tests  indicated  independence  of  the  samples  N.  , i = 1, . . . , 3000  with  95% 
confidence.  The  original  noise  samples,  0.  1 second  apart,  were  not  inde- 
pendent (i.  e.  , their  spectrum  was  not  flat).  In  addition,  application  of  the 
Kolmogorov-Smirnov  two- sample  test  indicated  homogeneity  with  95%  con- 
fidence. 

The  mean  and  standard  deviation  of  the  power  spectrum  and  of 
the  total  power  are  plotted  in  decibels  versus  frequency  in  Figure  III- 1 . 

It  is  well  known  that  the  power  of  seismic  noise  is  approxi- 
mately log-normal  (Freedman,  1 967  ; Swindell  and  Snell,  1977).  This 


III- 1 


32-point  transform 


! 


leads  us  to  the  conjecture  that  the  same  may  be  true  of  narrowband  seismic 
noise  (at  resolutions  of  0.  3 Hz  and  0.  6 Hz).  To  test  this  hypothesis,  the 
logarithms  of  the  noise  spectrum  samples  were  taken  and  the  resulting  en- 

* 

sembles  examined.  The  means  and  standard  deviations  of  these  distributions 
appear  in  Figure  III- 2. 

Goodness- of- fit  tests  were  made  to  test  the  normality  of  the 
log  power  distributions;  the  results  appear  in  Table  IH- 1 . Although  the  dis- 
tributions of  individual  FFT  cells  are  different,  those  above  1.  0 Hz  all  ap- 
peared approximately  normal  at  the  0.  05  critical  level  corresponding  to  the 
test  value.  Note  that  no  single  cell  was  as  close  to  normal  as  the  logarithm 
of  the  total  power  (i.  e.  , broadband  noise).  Examples  of  some  of  the  indivi- 
dual distributions  are  found  in  Figures  III- 3 and  III-4. 

The  above  noise  sample  would  appear  sufficiently  long  to  set  a 
threshold  for  false  alarm  rates  on  the  order  of  two  per  hour  (i.  e.  , that  value 
which  the  detector  output  exceeds  only  six  times  in  the  entire  2 2/3  hour  noise 
sequence).  However,  such  low  alarm  rates  depend  only  on  the  tail  of  the  dis- 
tribution and  as  a result  are  inordinately  sensitive  to  relatively  small  fluctua- 
tions in  the  statistics.  In  other  words,  short-term  noise  variations  will  have 
a noticeable  effect  on  the  threshold.  This  poses  the  following  dilemma:  A 
long  time  period  is  necessary  to  estimate  the  threshold,  but  the  threshold 
must  be  dynamic  over  a much  shorter  period. 

One  possible  solution  (that  adapted  here)  is  to  assume  that  the 
noise  distributions  in  question  depend  to  a 'close'  approximation  only  on  their 
first  two  moments.  These  parameters,  the  mean  and  standard  deviation,  may 
be  estimated  from  a much  smaller  ensemble  than  would  be  required  for  the  ac- 
curate determination  of  the  tail  of  the  distribution.  Once  the  mean  and  stand- 
ard deviation  have  been  found,  a normalization  transforms  the  distribution  to 
a canonical  one,  reasonably  independent  of  the  statistical  fluctuations,  and 
the  appropriate  threshold  may  be  set. 
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FIGURE  III -4 

EMPIRICAL  DISTRIBUTIONS  FOR  500-POINT  ENSEMBLES  FOR 
(a)  0.  6 Hz  (CELL  2)  (b)  2.  0 Hz  (CELL  4)  (c)  3.  2 Hz  (CELL  6) 
AND  (d)  TOTAL  POWER  FOR  A 16-POINT  TRANSFORM 


3 

We  now  have  two  compelling  reasons  for  accurate  on-line  esti- 
mation of  fi1  (k)  and  a'(k);  (1)  they  are  essential  parameters  in  the  receiver's 
operation  in  that  they  are  used  to  normalize  the  power  spectrum,  and  (2)  they 
determine  the  threshold  setting  for  a given  false  alarm  rate.  The  most  ob- 
vious method  for  their  computation,  a running  average,  requires  (number 
samples)  x (number  frequency  cells)  x (number  beams)  words  of  memory  and 
can  easily  become  unwieldly.  Instead,  it  is  advisable  to  use  a leaky  integra- 
tor (also  known  as  RC  or  exponential  averaging;  Horton,  1969), 


M(k) 


MX(k)  = (l-e)M1_1(k)  + fNX 


(HI-1) 


where 


l-e  = 

T = 
T 


-T  It 


N. 


averaging  time 

time  between  samples  (3.  2 sec  or  1. 6 sec) 

?e. 


i^1  time  sample,  k^  frequency  cell  of 


for  estimating  the  moments.  Some  interesting  pitfalls  exist  when  it  is  applied 
to  the  standard  deviation  b(k).  As  has  been  previously  indicated,  the  time 
constant  T must  be  chosen  large  to  give  a reliable  estimate,  but  small  enough 
to  follow  the  dynamics  of  the  noise.  In  the  current  study  € = 0.  025  ( T = 1 min 
to  2 min). 


SECTION  IV 


EVALUATION 


A. 


INTRODUCTION 


The  evaluation  of  a detector  is  not  often  an  easy  task.  For  the 
simple  case  of  a known  signal  in  Gaussian  noise,  the  detectability  depends 
only  on  the  ratio  of  the  signal  power  to  the  standard  deviation  of  the  noise 
which  in  turn  is  proportional  to  the  signal-to-noise  ratio  (Van  Trees,  1968). 
The  greater  this  value,  the  greater  the  probability  of  detection  (P^)  for  a 
given  probability  of  false  alarm  (P  ).  For  more  complex  situations,  a rough 

I 

estimate  of  detectability  is  the  'deflection'  criterion: 


E [f  (s+n)  - f(n)] 
o(i  (n» 


(IV- 1) 


where 

f(x)  = detector  output  for  input  x 

s = signal 

n = noise 

<7(x)  = standard  deviation  of  x 

which  gives  the  'distance'  between  the  output  distributions  for  signal-plus- 
noise  and  for  noise  alone.  By  varying  the  signal  standard  deviation,  one 
can  construct  two  detector  outputs  x and  y such  that  x detects  more  sig- 
nals than  y but  y gives  a larger  deflection  criterion  for  all  P^,.  It  is  also 
possible  for  one  detector  to  be  better  at  low  false  alarm  rates  while  another 
is  better  when  they  are  high.  In  short,  for  most  detectors  it  is  necessary 
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These  curves  depend  on  the  detector,  the  signal  being  detected, 
and  the  noise  statistics.  In  general,  for  a fixed  signal,  an  increase  in  signal 
power  should  provide  higher  detectability;  i.  e.  , all  points  on  the  ROC  curve 
would  move  upwards.  To  determine  ROC  curves  by  compiling  a large  num- 
ber of  seismic  events  is  impossible  since  no  two  signals  will  have  the  same 
shape,  and  estimates  of  their  magnitude  are  approximate.  Thus,  the  follow- 
ing procedure  was  implemented. 

A seismic  event  s(t)  of  duration  T seconds  was  chosen, 
beamformed  and  edited.  Several  thousand  noise  samples  n.(t)  of  the  same 
duration,  T , were  also  edited.  The  detector  was  then  run  separately  on  the 
samples  n.(t)  (noise  alone)  and  as(t)+n^(t)  (signal-plus-noise  at  a fixed  peak 
signal-to-RMS-noise  ratio  dependent  on  a).  For  each  value  of  a , a ROC 
curve  was  computed  from  the  two  output  distributions  (see  Figure  IV- 1 for 
details). 

In  the  present  study,  the  processing  gate  T was  taken  to  be 

3.  2 seconds,  while  FFT  transforms  of  both  16  and  32  points  were  examined. 

In  the  former  case  the  probability  distributions  were  determined  from  6000 

sample  points;  in  the  latter,  from  3000.  Six  ROC  curves  were  computed  for 

values  of  a corresponding  to  signal-to-noise  ratios  of  -6  dB,  -3  dB,  0 dB, 

3 dB,  6 dB,  and  9 dB.  For  each  curve  the  probability  of  detection,  , was 

plotted  for  false  alarm  probabilities  of  P = 0.  001,  0.  002,  0.  004,  0.  008, 

F 

0.  02,  and  0.  04,  corresponding  to  false  alarms  of  1.1,  2.  2,  4.  5,  9.  0,  22.  5, 

and  45.  0 per  hour.  This  was  sufficient  for  comparing  detectors  even  though 

some  lower  rates,  less  than  P = 0.  001,  would  also  be  of  interest.  We  note 

F 

that  for  P = 0.  001  and  only  3000  sample  points,  the  results  are  subject  to 
F 

some  statistical  variation,  and  for  even  smaller  values  a larger  sample  space 
would  certainly  be  necessary. 

Four  representative  short-period  seismic  events  recorded  at 
KSRS  were  beamformed  and  edited  (32  sample  points  at  a rate  of  10  per 
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noise  alone  : n.  N Detector 


CONSTRUCTION  OF  A ROC  CURVE  (COORDINATES  (PF(0),  PD(0))) 
USING  AN  M- POINT  SAMPLE  SPACE 


second)  for  use  as  signals  in  evaluating  the  detector.  These  signals,  referred 
to  as  A,  B,  C,  and  D,  are  documented  in  Table  IV- 1,  and  their  spectra  appear 
in  Figure  IV- 2.  Note  the  presence  of  high  frequency  energy  in  C and  D (pre- 
sumed explosions)  not  found  in  A and  B.  The  three  hours  of  noise  analyzed 
in  Section  III  was  also  used  in  the  simulation.  The  prefilter  consisted  of  a 
bandpass  filter  from  0.  8 Hz  to  3.  6 Hz  (cells  4 to  12  in  a 32-point  FFT  trans- 
form). However,  for  event  C,  the  low  signal-to-noise  ratio  in  the  above  band 
(2.  3 dB)  necessitated  a narrower  prefilter  (1.4  Hz  to  3.  6 Hz)  since  the  model 
signal  could  not  be  accurately  estimated  throughout  the  larger  band.  For  the 
16-point  transforms,  the  corresponding  prefilters  were  0.  93  Hz  to  3.  5 Hz 
(cells  2 to  6)  and  1.  7 Hz  to  3.  5 Hz  (cells  3 to  6). 

B.  COMPARISON  OF  DEFLECTION  DETECTOR  AND  POWER  DETECTOR 

We  now  consider  the  performance  of  the  two  log  power  detectors 
described  by  equations  (II- 1)  and  (II- 2 ).  Before  presenting  the  results,  let  us 
estimate  the  detectors'  relative  behavior  using  equations  (II-6)  through  (II-9) 
and  Figure  H-2.  For  a 3.  2- sec  gate  the  gain  of  the  deflection  processor,  for 
models  A,  B,  C,  and  D documented  in  Table  IV-1,  is: 
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CG,  + 2.  0 - 4.  2 
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CG  , - 2.  7 
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CG,  - 2.  2 
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signals  A,  B,  D 
signal  C 
signals  A,  B,  D 
signal  C. 


(IV-  2) 


Table  IV- 2 contains  the  calculation  of  the  coherent  gain  and  the  numerical 
results. 
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FIGURE  IV- 2 

SPECTRA  OF  FOUR  SIGNALS  AND  THEIR 
SIGNAL- TO- NOISE  RATIOS 
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ROC  curves  for  the  four  signals  at  signal-to-noise  ratios  of 
-6  dB,  -3  dB,  0 dB,  3 dB,  6 dB,  and  9 dB  are  pictured  in  Figure  IV-3.  It 
can  be  seen  that  in  comparison  with  the  power  detector  the  deflection  detector 
performs  better  for  cases  C and  D,  approximately  equally  for  case  A,  and 
more  poorly  in  case  B.  In  order  to  estimate  the  actual  gains  and  compare 
them  with  those  predicted  in  Table  IV-2,  Figure  IV-4  contains  plots  of 
versus  signal-to-noise  ratio  with  P fixed  at  0.  004.  As  seen  in  Table  IV-3, 
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the  actual  gains  found  from  Figure  IV-4  reflect  those  estimated  reasonably 
well,  especially  considering  the  approximate  nature  of  our  reasoning  (i.  e.  , 
the  true  gains  depend  in  a non-linear  manner  on  P , P , and  even  S/N). 
Loosely  speaking,  when  the  signal  is  concentrated  at  low  frequencies,  the 
region  of  highest  noise,  filtering  provides  little  coherent  gain  and  the  power 
detector  is  preferable.  For  those  having  more  high  frequency  energy  (C  and 
D),  the  deflection  detector  is  better.  (For  the  curious  reader,  a typical  set 
of  probability  distributions  (noise  and  signal-plus-noise)  for  S/N  = 0 dB  are 
included  in  Figure  IV-5.  ) 

C.  VARIATIONS 

Detection  curves  (at  P = 0.  004)  comparing  the  effects  on  the 
deflection  detector  of  the  FFT  transform  length,  the  intermediate  logarithmic 
transformation,  and  forming  the  deflection-power  are  found  in  Figures  IV- 6 
and  IV-7.  A brief  analysis  of  these  results  now  follows. 

1.  16- Point  Transform 

Let  us  examine  the  consequences  of  reducing  the  FFT  trans- 
form length  of  the  deflection  detector  from  32  to  16,  According  to  Section 
II-B-3,  this  should  exchange  coherent  gain  for  incoherent  gain  (10  log  V~2 
= 1.5  dB).  In  Table  IV-4,  we  estimate  the  performance  of  the  detector  rela- 
tive to  a 32-point  transform  and  compare  it  with  that  deduced  from  Figure 
IV-6  by  subtracting  S/N  for  the  32-point  transform  from  that  for  the  16-point 


0.0  0.33  0.6?  1.0 


J'°g10FF 

Signal  A 


L°g10PF 
Signal  B 


Signal  C Signal  D 


FIGURE  IV- 3 
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FIGURE  IV-4 

COMPARISON  OF  THE  PERFORMANCE  OF  THE  DEFLECTION 
DETECTOR  (SCLID  LINE)  AND  POWER  DETECTOR  (DASHED  LINE) 
FOR  A FALSE  ALARM  PROBABILITY  OF  P =0.  004  (4.  5 PER  HOUR) 


TABLE  IV- 3 

COMPARISON  OF  ACTUAL  GAINS 

OF  THE  DEFLECTION  DETECTOR  OVER  THE  POWER  DETECTOR 

TO  THOSE  ESTIMATED  (G.-G  ) FOR  FOUR  SIGNALS 
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FIGURE  IV-5 

OUTPUT  PROBABILITY  DISTRIBUTIONS  FOR  INPUTS  OF  NOISE  AND  OF 
SIGNAL- PLUS- NOISE  (AT  S/N  = 0 dB)  FOR 
DEFLECTION  AND  POWER  DETECTORS;  SIGNALS  A,  B,  C,  AND  D 
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-♦  = deflection  with  logarithm 
— • = power  with  logarithm 
♦ = deflection  without  logarithm 
= power  without  logarithm 


FIGURE  IV- 7 

COMPARISON  OF  FOUR  DETECTORS 
FOR  P_  = 0.  004 


transform  at  P = 0.  5.  Note  that  for  the  16-point  transform,  the  deflection 
detector  did  not  always  sharply  distinguish  a single  frequency  cell  (partly  be- 
cause the  simulation  split  the  signal  into  two  16-point  segments)  which  neces- 
sitated listing  more  than  one  k^.  Except  for  one  case,  it  was  advantageous 
to  use  the  longer  FFT  transform, 

2.  Logarithmic  Transformation 

We  recall  that  the  purpose  of  the  logarithmic  transformation, 
equation  (H-4),  was  to  'Gaussianize'  the  noise  and  thus  improve  the  ability 
of  the  deflection  statistic  Z = (P l-^,)/cr,  to  reflect  signal  detectability.  In 
this  instance,  our  intuition  appears  to  have  failed  us.  For  the  signals  and 
noise  examined,  no  such  improvement  was  present.  In  fact,  the  transforma- 
tion produced  a slight  decrease  in  performance.  Typical  results  are  pictured 
in  Figure  IV- 7. 

3.  Deflection  - Power  Detector 

As  may  be  seen  in  Figure  IV- 5,  the  performance  of  the  deflec- 
tion power  detector  (equation  (II-2))  was  uniformly  poorer  than  that  of  the  de- 
flection detector  or  power  detector.  This  was  considered  somewhat  surpris- 
ing, and  it  was  postulated  that  in  a different  situation  (different  signal  and 

2 

noise  spectra)  it  might  perform  better.  In  view  of  the  result  that  S(k)//x  (k) 

is  an  optimal  filter  for  a power  detector,  i.  e.  , for  detection  of  a signal  of 

known  power  spectrum  in  noise  with  a Gaussian  power  spectrum  (Moll,  1974; 

Eckhart,  1972),  it  was  decided  to  create  a synthetic  signal  by  modifying  the 

2 

spectrum  of  event  D giving  it  the  property  l/cx(k)  = S(k)//i  (k).  As  evidenced 
in  Figure  IV-8,  there  was  a dramatic  improvement.  This  does  not,  however, 
change  the  conclusion  of  the  previous  evaluation;  i.  e.  , that  the  deflection- 
power  detector  is  not  suitable  for  short-period  seismic  detection. 
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deflection- power 
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FIGURE  IV- 8 

COMPARISON  OF  DETECTOR  PERFORMANCE  FOR  SYNTHETIC  SIGNAL 

(a)  SIGNAL  AND  SIGNAL- TC  - NOISE  RATIO  SPECTRA 

(b)  ROC  CURVES  FOR  DEFLECTION-POWER  (SOLID) 

AND  POWER  (DASHED)  AT  3 dB  INTERVALS  OF  SNR 

(c)  P„  VERSUS  SIG /NOISE  FOR  P =0.  004 
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SECTION  V 
CONCLUSIONS 


The  deflection  detector  and  the  conventional  power  detector 
shown  on  Figure  II- 1 were  compared  and  evaluated  on  a set  of  four  seismic 
events  buried  in  seismic  noise.  The  detection  performance  of  the  deflection 
processor  was  the  same  for  one  of  the  signals,  showed  a 2 dB  relative  loss 
in  signal-to-noise  ratio  processing  ability  for  a second  signal,  and  a 2 dB 
gain  for  the  two  remaining  signals.  More  generally,  it  was  found  that  the 
deflection  algorithm  is  at  an  advantage  in  detecting  signals  of  enhanced  ener- 
gy content  above  1.  25  Hz.  Such  signals  appear  to  have  narrowband  signal- 
to-noise  spectra  with  seismic  noise  power  concentrated  at  the  lower  frequen- 
cies. This  results  in  greater  coherent  gain. 

The  potential  usefulness  of  the  deflection  detector  is  perhaps 
better  visualized  in  a more  general  setting.  Figure  V-l  illustrates  three 
levels  of  signal  information  and  appropriate  detector  configurations.  Although 
the  deflection  detector  possesses  more  versatility  when  the  variety  of  signal 
spectra  is  large.  For  a single  known  signal,  the  ideal  detector  is  a matched 
filter.  However,  for  the  detection  of  a whole  variety  of  signals  of  diverse 
spectral  content,  the  matched  filter  or  shaped  filter  should  be  replaced  by  a 
bank  of  filters  (one  for  each  signal  type)  and  a rule  for  deciding  between  sig- 
nals. The  deflection  processor  actually  does  this  in  a rudimentary  manner 
with  FFT  cells  used  as  bandpass  filters.  One  might  hope  that  a deflection 
criterion  could  also  work  at  the  second  level  shown  on  Figure  V-l.  Such  a 
detector,  for  a variety  of  three  signal  spectra,  is  diagrammed  in  Figure  V-2. 
Clearly,  a crucial  design  factor  is  the  determination  of  the  number  of  signal 
types  in  the  class  to  be  detected  and  the  modeling  of  those  signals.  Thus,  an 
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evaluation  of  the  usefulness  of  the  deflection  detector  and/or  the  construction 
of  a more  powerful  detector  similar  to  that  of  Figure  QI-2  require  a thorough 
study  of  the  signal  spectra  in  the  region  of  interest  and  of  the  noise  spectra. 
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APPENDIX  A 


In  many  situations,  it  is  desirable  to  obtain  simultaneous  esti- 
mates of  the  means  and  standard  deviations  for  a large  number  of  parameters. 
For  example,  in  broadband  detection  several  hundred  frequency  cells  may  be 
involved.  A running  average  over  a hundred  points  would  require  in  the 
neighborhood  of  20,  000  to  50,  000  words  of  memory.  A common  procedure 
for  circumventing  this  difficulty  is  a weighted  average. 


, , = G - € )|i.+  eY„ 

n + i ~n  n 


(A-l) 


where 


u = estimated  mean  at  sample  n 
n 

y = nth  sample  point 
n 


O < € < 1. 

Equation  (A-l)  is  analogous  to  a RC  circuit  average  with 


1 - € = e 


-I-*  ~1-  ± 
T N 


(A-2) 


where  T is  the  time  between  samples  and  r=  RC  is  the  response  (averaging) 
time  (Horton,  1969),  and  r/T  — N corresponds  to  the  number  of  samples 


averaged. 


A similar  formula, 

- 2 2 

<r  = ( l“  « ) o-  +(«-“-)  (y  - M _) 
w n+1  n 2 n n 


(A- 3) 


for  the  standard  deviation  involves  possible  pitfalls,  which  we  proceed  to 
discuss. 


is  introduced. 


We  note  that  the  extra  factor 
/3-<-  — - «— — 


in  order  that  cr  ^ an  asymptotically  unbiased  estimate  of  the  variance.  It 

corresponds  to  the  well-known  factor  N/N-l  used  in  estimating  the  variance 

2 

for  N samples  (Bendat  and  Piersol,  1966).  More  precisely,  let  n and  <r 
be  the  true  mean  and  variance,  and  define 


/*«=  lim  E(Mn) 

n— * oo 

cr2  = lim  E(cr^),  (A-4) 

co  n 

n— *® 

which  exist  since  the  expectations  of  equations  (A-l)  and  (A- 3)  are  stable 
linear  difference  equations  ( e < 1).  Taking  the  expectation  of  both  sides  of 
equation  (A-l)  gives 


E(/x  ) = (1  - c ) E (/*  ) + «/* 
n+i  n 


so  that 
or 


«M®  = eA* 


A1®  = A1 

The  square  of  equation  (A-l)  gives 


^n+1 


.2  2 


2 2 


= (1  - € ) + 2 c (1  - € ) u_y_  + € y 


Atn7n 


n 


Let 


( ju2)  = lim  E ( uj). 
oo  n 

!!-♦  oo 


(A  - 5) 
(A- 5 a 


(A- 6) 


(A- 7) 


2 2 2 

Then, since  u and  y are  independent  and  E(y  ) = cr  + a , equation  (A-l) 
n n nr 


n 


yields 


(2e  - e 2)  (M2)  = 2 e (1  - c ) \l2  + « 2 ( ^ 2 + fi  2) 


From  equation  (A- 3) 


e<r*=0(„Z+S  - z/  + („2)  ) 


- 13  '°-2  + _rn—  -2> 

= * T~  1 'TTT’  "■  ' 


Thus, 


_ 2 2 

cr  - cr 
eo 


(A- 9) 


We  note  that  for  large  N (i.  e. , N >10),  t is  much  less  than  one,  and  thus 
the  error  incurred  by  replacing  e - — - — in  equation  (A- 3)  by  g is 
negligible. 

Two  other  candidates  for  recursion  formulae  for  the  variance 


?„+i2  = (1-  «)?„2  + <«  +-TTa»  <y„2  - (*»2) 


(A-10) 


n+l  ^n+1  " Mn+1)  (1  + 2(1  - e)  ) 


where 


_ 2 2 
= 1 - e ) x + € y . 

n n 


(A-ll) 


Note:  cr  represents  the  method  of  equation  (A- 3);  a-,  of  (A-10);  and  cr , of  (A-ll). 

...  -2  2 _2 

None  of  these  is  biased,  i.  e.  , (yM  = a oo  = o ; however,  while  the  behavior 
of  equation  (A- 3)  and(A-ll)  is  excellent,  that  of  (A-10)  is  intolerable  (see 


Figure  A-l). 


The  cause  of  the  difficulties  lies  in  the  standard  deviation  of 


the  estimated  variance,  i.  e.  , in 


„ . 2 2 2 

E ( cr  , . , “ rr  ) • (^ 

estimate  & 

The  quantity  in  (A-12)  may  be  computed  for  the  three  recursion  formulae 


(A -12) 


(A-3),  (A-10),  and  (A-ll)  by  substituting  the  appropriate  recursion  formula 


THE  STANDARD  DEVIATION  CN  AN  ENSEMBLE  OF  NORMAL  PSEUDO  RANDOM  NOISE 

(MEAN  = 10,  ST.  DEV.  =1.0) 


in  (A- 12)  and  applying  the  techniques  used  in  equations  (A- 5)  through  (A- 9). 


The  algebra  is  extremely  tedious  involving  the  calculation  of  the  various 
2 1 k 

moments  (E  (a  ..  . U );  l - 0,  1,  2,  and  k = 0,  . . . , 4)  and  thus  only 

PCTItqyiP  n ' 


' estimate  n ’ ' * ’ ' 

the  final  expressions  will  be  presented.  To  the  first  order  in  £ , we  have 


EiO"  - a ) 


*■«£ 


G - a 

2 


(A- 13) 


+ Z/jB  + O 


(A  - 1 4 ) 


„-2  2 2 

) 


G - a4  1 

=f  h — j 


(A- 15) 


3 4 

where  B = E(y  - u)  and  G - E(y  - u ) are  the  third  and  fourth  central 
n n 


n.oments  of  the  distribution  (assumed  stationary),  respectively.  For  the 
Gaussian  case,  these  may  be  simplified: 


2 2 2 4 

£(<7*  - a ) ~co 


(A- 16) 


„ .—2  2 2 _ . 4 2 2 

E(<Tao-^)  e(-a  + O V ) 


(A  - 1 7 ) 


-2  22  f n 4 

E(Ow  - a ) - €<T 


(A -18) 


For  small  £ (i.  e.  , large  N)  the  formulae  (A-16)  to  (A-18)  are 


quite  accurate.  A comparison  between  the  values  computed  by  (A-16)  and 


(A-18)  with  the  exact  values  for  the  Gaussian  case  appear  in  Figure  A-2. 

—2  2 

It  follows  from  equations  (A-13)  through  (A-15)  that  a and  CT  have  compara- 
...  -2 


ble^behavior  and  are  always  better  than  O . In  general,  for  small  values  of 


y 2 2 

— all  three  will  give  similar  results;  however,  for  n <<  a , the  variance 

? 2 J _2  _2 
of  (7 qo  and  (7  ^ will  be  of  the  same  order,  whereas  that  of  cr^  will  be  much 


larger.  This  explains  the  erratic  behavior  observed  in  Figure  A-l. 
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FIGURE  A- 2 

GRAPHS  OF  THE  STANDARD  DEVIATION  OF  THE  ESTIMATED 
VARIANCE  VERSUS  THE  RATIO  OF  MEAN  TO  STANDARD  DEVIATION 
FOR  GAUSSIAN  DISTRIBUTIONS  FOR  (a)  c = 0.  2 AND  (b)e=  0.  02. 
THE  SCUD  LINES  REPRESENT  THE  TRUE  VALUE  AND  THE  O'S 
REPRESENT  THE  APPROXIMATIONS  OF  (A16)  TO(A18) 
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